Introducing New Mechanism in the Learning Process of Fdica-based Speech Separation
نویسندگان
چکیده
The blind source separation for speech using frequencydomain independent component analysis(FDICA) is considered. As a source separation system, Saruwatari et al.[1] proposed a method by integrating the independent component analysis (ICA) and array signal processing. In this paper, we introduce the following two techniques into the learning process of the method[1]. (1)Classification of acquired array signals with respect to the number of speakers. (2)Direction-of-Arrival(DOA) estimation for each speaker using the intervals(frames) which are classified into singlespeaker frame. Through some experiments, we can confirm that these techniques are effective to guarantee the convergence to the global optimal solution in the learning process.
منابع مشابه
Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations
This paper describes two new algorithms for blind source separation (BSS) based on frequency-domain independent component analysis (FDICA). One is FDICA with prefiltering by a speech sub-band passing filter to slow down the learning speed in low signal-to-noise ratio (SNR) sub-bands. The other is FDICA with sub-band selection learning to reduce the number of iterations for those sub-bands. The ...
متن کاملDoctoral Dissertation Blind Source Separation Based on Multistage Independent Component Analysis
A hands-free speech recognition system and a hands-free telecommunication system are essential for realizing an intuitive, unconstrained, and stress-free human-machine interface. In real acoustic environments, however, the speech recognition performance and a speech recording performance significantly degraded because we cannot detect the user’s speech with a high signal-to-noise ratio (SNR) ow...
متن کاملMultistage Ica for Blind Source Separation of Real Acoustic Convolutive Mixture
We propose a new algorithm for blind source separation (BSS), in which frequency-domain independent component analysis (FDICA) and time-domain ICA (TDICA) are combined to achieve a superior source-separation performance under reverberant conditions. Generally speaking, conventional TDICA fails to separate source signals under heavily reverberant conditions because of the low convergence in the ...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملOverdetermined Blind Separation of Acoustic Signals Based on MISO-Constrained Frequency-Domain ICA
We propose a new overdetermined blind source separation (BSS) using frequency-domain independent component analysis (FDICA) based on multiple-input singleoutput (MISO) constraint. To achieve a superior separation performance under reverberant environments, we set the number of microphones to be larger than that of sources. This leads to alternative problems in which the sound qualities of the s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003